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Abstract 

In an earlier paper the author defined iVii, configurations, and stated a conjecture concerning them 
which would lead to an improvement by a constant factor to the sphere-packing bound for linear double 
error correcting codes. Here a computer search is presented, in an effort to gather evidence on the 
conjecture. 

1 Introduction 

In [1], some configurations in binary linear codes, first considered in [3], were further considered. In particular, 
a conjecture was made, on the maximum number of rows of a configuration called an iVi^ configuration. If 
true, the conjecture would yield an improvement by a constant factor to the currently known bound (sphere 
packing bound) on the maximum length n of a double error correcting binary linear code of redundancy r. 

As in [4], the following conventions will be used. Let denote the finite field of order q, for a prime 
power q. A binary code of length n is a subset of the vector space over ■ Such a code is linear if it is a 
subspace; if k is the dimension the redundancy r is defined to be n — k. Codewords are generally denoted v, 
w, etc. Positions are generally denoted i, j, etc., with 1 < i < n. As usual, Vi denotes the element of ^2 in 
position i of V. A vector in '^iU called a bit vector, of length n. Generator matrices will be considered 
to be fc X n, and parity check matrices n x r. A vector v may be identified with its "support" {i : Vi = I}. 
The Hamming weight \v\ of a vector is the cardinality of the support. 

An iVi configuration is defined to be a set S of weight 5 vectors, of minimum distance 6, where there 
is a weight 5 "anchor" vector v and a position i £ v, such that for w G 5, i G w and |w H z«| = 2. An N[ 
configuration is a configuration of triples (weight 3 vectors), which may be obtained from an A^i configuration 
by deleting the positions of v. 

Again as in [4^ a partial linear space is defined to be an incidence matrix (matrix over J-2), where two 
columns are incident to at most one row. Note that the requirement may equally be stated as, two rows 
are incident to at most one column; the requirement is that no "rectangle" of I's occur. The following is 
observed in . 

Theorem 1 An N[ configuration is a partial linear space, of constant row weight 3, together with a partition 
of the rows into 4 or fewer parts, such that in each part the rows are disjoint. 

Note that an A^i configuration from which an N[ configuration arises can be determined from the partition. 
Clearly the maximum number of rows in an Ni configuration of length n (equivalently an N[ configuration 
of length n — 5) is 4[(n — 5)/3j. This bound is achieved in a 3-(n,5,l) design; these exist for n — 4™ + 1 
where m > 1 (see PQ, theorem 6.9). 

An A^i configuration is said to be an A^i^ configuration if the linear span of its rows and the anchor 
vector has minimum weight 5, and an N[j^ configuration is an N[ configuration which arises from an A^i^ 
configuration. Let NiL{n) denote the maximum number of rows in an Nil configuration in a linear double 
error correcting code of length n. The following conjecture was made in [4]. 

Conjecture 1 Ni^^n) is < Ci{n — 5) almost everywhere, for a constant c\ smaller than 4/3. 

It was also shown that the conjecture yields an upper bound on the length of a linear double error 
correcting code of redundancy r, better by a constant factor than the sphere packing bound. A computer 
search showed that for r < 8, the conjecture holds with c\ — 2/3. However, an NiL{n) configuration with 
n — 5 = 60 and 44 rows was found in a cyclic code. 
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In this paper, a computer search is carried out for all N[j^ configurations with n — 5 <= 18 and r <= 14, 
where r is the number of rows. This serves two purposes. First, the existence of Nil configurations for 
small r, n pairs is determined. Second, data is provided for possibly inferring rules for an inductive proof of 
the conjecture. It thus represents an attempt to achieve a goal stated in [5*], that "occasionally a practical 
algorithm and thereby a classification result is obtained." 

We note here that there is a "replication" argument which shows that if a ratio r/ [n — 5) is achieved 
then it is achieved infinitely often. 

Theorem 2 Let M he an incidence matrix whose rows are divided into classes Mi, < i < 3. For m > 1 
let Af™ have m copies of Mi down the diagonal. Let M™ be the matrix whose classes are the M™. 

a. if M is an N[ configuration then Af ™ is; and 

b. if M is an configuration then Af™ is. 

Proof: Part a follows readily using theorem 1 and is left to the reader. For part b, theorem 5b of [1] will 
be used. By a "section" of Af "* we mean the rows or columns of one of the replications. Consider a sum of 
rows of Af". If the weight of the sum is zero in a column section then the corresponding rows can be deleted 
from the sum. If there is more than one column section where the sum is nonzero then the total weight of 
the sum is already at least 6. Otherwise, the requirements of theorem 5b are satisfied in the nonzero section. 

2 Outline of search procedure 

A search for configurations can be carried out inductively, computing the isomorphism classes with r 
rows from those with r — 1 rows. A bound can be imposed on the number of columns c (n — 5 in the preceding 
section), since only configurations with large r / c are of interest. In this paper, the bound on c is 18, and the 
maximum value of r considered is 14 (although sec section 4). 

General procedures for isomorphism testing of incidence matrices exist, including Brendan McKay's 
"Nauty" [Tl], and [8]. For one use of Nauty in coding theory, see [5]. Discussions of using isomorphism 
detection procedures in searching for combinatorial configurations can be found in the literature, for example 
[2] , H J [inj J and [m . One common method is to convert each configuration as it is generated to a "canonical 
representative" of its equivalence class under isomorphism. 

In this paper, specialized methods are used to reduce the matrix to one of several "partial canonical- 
izations". Each such is partitioned into 4 row classes and 15 column classes. Standard methods are then 
used to canonicalize the partitioned partial canonicalizations, and the lexicographically highest such is used. 
Source code may be requested by email from the author, and the description here omits various details. 

An N[ configuration is assumed to be given as an incidence matrix Af , and a partition into 4 or fewer 
parts of the rows. It may be assumed that the rows of a part are consecutive; in some contexts missing parts 
are considered to be empty parts. The parts are also called "row classes" . 

Numbering the parts from to 3, a column may be given a type, namely the function / mapping {0, 1,2,3} 
to {0, 1}, where f{i) = 1 iff the column has a one in part i. The type may be considered as a bit string of 
length 4, and denoted by a hexadecimal digit 0-F (bit being the low order bit). 

Only configurations with no columns of type need be considered. Writing the nonzero types in the 
order F7BDE3596AC1248, a configuration has a "signature", the 15-tuple of natural numbers which in each 
position gives the number of columns of the corresponding type. 

The symmetric group S'4 acts on the row classes, a permutation a G S'4 being considered as "moving" the 
class in position i to position a{i). This induces an action on the column types, namely T 1— > a[T] where T 
is the support (note that, considering T as a characteristic function, T'{p{i)) — T{i) where T' is the image). 
Considering the signature to be a function a from {T} to the natural numbers, a acts on the signatures by 
mapping cr to CTq, where CTQ,(a[T]) — cr{T). 

Given a signature cr, the "canonicalized signature" tT"^ is defined to be the lexicographically greatest 
among the ctq,. As will be seen, it is useful to determine this, and also the right coset Gr of elements of a 
for which aa = cr*^. These can readily be determined by trying all 24 possiblities. Indeed, since signature 
canonicalization is of secondary cost, an efficient implmentation of this method would undoubtedly suffice. 
As will be seen, one refinement was made. 

For canonicalizing the signature it is useful to have a library of routines for computing with permutations 
in S'4. The permutations can be ordered (a recursive order where the first 6 elements are ^3 was used), and 
tables coded which apply a permutation given as an index in the order, by an array reference. 
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>S'4 has 30 subgroups [9|. Not all of them can occur as a stabilizer of a signature, but it is simplest to 
code tables for all of them, in particular a table of elements per subgroup index. There are 234 cosets. A 
coset may be represented as a bit vector of length 24. As noted above, the right coset for a canonicalized 
signature is readily computed along with it. A hash table can be used to obtain the subgroup index and a 
coset representative from the bit vector. 

To speed up the process of canonicalizing the signature, the columns may be grouped according to weight, 
and for each weight, the canonicalized signature and right coset determined successively. For a given weight, 
only permutations in the stabilizer of the higher weight columns need be considered. 

For the weight 3 columns, the weights may be sorted. There are 8 possibilities S1RS2RS3RS4 where R is 
< or = among the sorted sizes; each yields a partition of {1, 2, 3, 4}, and thereby a subgroup of 6*4, consisting 
of the product of the symmetric groups acting on the parts. For the remaining weights, all possibilities within 
the stabilizer so far are tried. It might be possible to acheive a speed up in the case of weight 2 vectors with 
the full group acting, and this is certainly true in the case of weight 1 vectors; but this was omitted. 

The columns of type / for some / will be called a column class. In addition to requiring an incidence 
matrix to have the rows of each row class contiguous, the columns of each column class will be required to 
be also. Further, the column classes are required to be in the order given above. A matrix M with a such 
a row partition consists of 60 = 4 x 15 blocks, one for each row class and column class (some of blocks may 
be empty, i.e., have rows or columns). 

Supposing a method is specified for specifying the canonicalization M'^^ of an N[ configuration M when 
the row classes are fixed, the canonicalization of M'^ of M may be defined as the lexicographically highest 
of the M^^ , where Ma are those matrices obtained from M by permuting the row classes, to yield the 
canonicalized signature (i.e, where a G Gr where Gr is as above. 

When applying a permutation of the row classes, the columns may be permuted in any manner to obey 
the column restriction. In obtaining M'^-'^, only permutations which preserve the blocks need be considered. 

At first, the author intended to write a canonicalization procedure from scratch, under the belief that 
this would be faster and thus more likey to complete. However, Nauty has a reputation for being fast; it 
permits specifying an initial partition, in this case into blocks as above; and a preliminary version using it 
would permit debugging the other code and provide a check. A version using Nauty was thus coded. 

The generation of the configurations proceeds in stages, for r increasing up to some maximum value, 
where the number of columns is limited to some maximum value. At the beginning of the stage for r, the 
r — 1 row configurations are packed in an array; the rest of memory is used for a hash table for the r row 
configurations. At the end of a stage, the hash table is packed down to the beginning of memory. 

Each r — 1 row configuration is unpacked, and the span generated. For each part of its row partition, a 
row is added in every possible way. For each resulting configuration, a check is made whether it is Nil. If 
so its signature is computed and canonicalized, and M'^ is obtained as described above. M'^ is added to the 
hash table if it is not already in it. 

3 Results of search 

Initially the program was run with maximum values of 10 and 15 for r and c. This ran in 21 seconds, so the 
limits were raised to 12 and 18. This run found that configurations with r = 12 and c = 16 exist. The limit 
on r was increased to 14. The time for the run with these values was 228 minutes. 

For this paper, further increases to the limit were omitted, as this would have required additional work. 
For example, the input graph to Nauty has 14+18=32 nodes. If the graph has more than 32 nodes the rows 
of the adjacency matrix no longer fit in a word, and Nauty's execution time would increase. Again, though, 
see section 4. For this paper, with the limits of 14 and 18, the Nauty version is the final version. 

Table 1 shows the number of isomprhism classes of Nil configurations with r rows and c columns, for 
2 < r < 14 and 5 < c < 18. 

From this, the value of ci is larger than 2/3. Indeed, writing Cmin for the smallest c for which configurations 
exist, for even r with 8 < r < 14, Cmin increases by 2 as r does. This suggests that ci is at least 1. 

In [3] the following observations are made. 

• An N[j^ configuration with 2 fiags in each column is a cubic graph. 

• Such a cubic graph must be triangle free. 

• Two of the 6 cubic graphs on 8 vertices are triangle free. 
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Table 1: Isomorphism class counts 



• Among the N[]^ configurations with r = 8 and c = 12, both triangle-free cubic graphs occur. 

From the table, there are 5 configurations with r — 8 and c = 12. It is readily verified that 3 of these are the 
cube, and 2 are the other possible cubic graph. For all 5 configurations the list of partition sizes is 2,2,2,2. 

4 A second search 

Since the "gcc" compiler for the "x86" processor supports a 64 bit "long long" type, a version of Nauty can 
be used where the rows of an adjacency matrix fit in a "long long". A partial search was conducted using 
this feature. Starting from the configurations with r = 13, and c = 17 or c = 18, only extensions by up to 
two columns were considered, and only the minimum two c values for each r used as input to the next stage. 

This search yields an upper bound on Cmin- The results are given in the following table. After r = 19 
the program aborted due to insufficient memory. 
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The results of this search suggest that the status of conjecture 2 is unclear. The ratio r/c increases, but 
so slowly that the computer searches done here don't give any clear indication of its limiting value. For 
example, it is still open whether it can exceed 1. 

It should also be noted that the best known linear double error correcting codes do not contain N[j^ 
configurations with values of r/c as high as those of configurations found here (see (5). 

Topics for further research clearly include the following. 

• An Nil configuration yields a linear double error correcting code containing it, with n = c + 5 and 
k the rank of the configuration, augmented with v. This code may not be very good. A "goodness" 
measure of interest is (l + rt + (2)) /2^n — k). 

• Good codes for a given configuration, and configurations for a given good code, should be more exten- 
sively investigated. 

• Methods for obtaining N configurations from A^i configurations (for example using classical groups) 
might be of interest. 

• Although omitted here, determination of all configurations up to r = 16 and c = 19 can be probably 
be achieved by the methods presented here. 
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